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In 1992 Carisen et al. reported a significant global decline in sperm density between 1938 and 
1990 [Evidence for Decreasing Quality of Semen during Last 50 Years. Br Med J 305:609-613 
(1992)]. W e subsequently published a reanalysis of the studies included by C arisen et al. [Swan et 
al. Have Sperm Densities Declined? A Reanalysis of G lobal T rend Data. Environ Health Perspect 
105:1228-1232 (1997)]. In that analysis we found significant declines in sperm density in the 
U nited States and Europe/Australia after controlling for abstinence time; age; percent of men with 
proven fertility, and specimen collection method. The declines in sperm density in the United 
States (approximately 1.5%/year) and Europe/Australia (approximately 3%/year) were somewhat 
greater than the average decline reported by C arisen et al. (approximately 1%/year). H owever, we 
found no decline in sperm density in non-Western countries, for which data were very limited. In 
the current study, we used similar methods to analyze an expanded set of studies. We added 47 
E nglish language studies published in 1934-1996 to those we had analyzed previously. T he aver¬ 
age decline in sperm count was virtually unchanged from that reported previously by C arisen et al. 
(slope = -0.94 vs. ~0.93). T he slopes in the three geographic groupings were also similar to those 
we reported earlier. In N orth America, the slope was somewhat less than the slope we had found 
for the United States (slope = '0.80; 95% confidence interval (Cl), -1.37--0.24). Similarly, the 
decline in Europe (slope = -2.35; C I, _ 3.66- _ 1.05) was somewhat less than reported previously. 
As before, studies from other countries showed no trend (slope = -0.21; Cl, -2.30-1.88). These 
results are consistent with those of C arisen et al. and our previous results, suggesting that the 
reported trends are not dependent on the particular studies included by C arisen et al. and that the 
observed trends previously reported for 1938-1990 are also seen in data from 1934-1996. Key 
words epidemiology, geography, regression analysis, semen quality, sperm density, trend. Environ 
H ealth Persped 108:961- 966 (2000). [O nline 5 September 2000] 
http://ehpnetl.niehsnih.gov/docsf2000/108p961-966awan/abstrad.html 


I n 1992 C arisen et al. (1) stated that 
...reports published worldwide indicate clearly 
that sperm density has declined appreciably dur¬ 
ing 1938-1990. 

Subsequently, this conclusion has been sup¬ 
ported by findings from some studies (2-4), 
but not by others (5-7). The critical issues 
raised concerning this study fall, broadly, 
into three categories Some authors suggested 
that poor or highly variable data invalidated 
any inference about trends in sperm counts 
(8,9). Others questioned the validity of the 
statistical methods used in this analysis 
(8,10,11). Bias due to changing study popu¬ 
lations (12) or confounding by factors such 
as age and abstinence time (time between 
sample collection and last ejaculation) were 
also suggested (4,8). 

We conducted several analyses designed 
to examine these concerns. The first, pub¬ 
lished in 1997, reanalyzed the studies used by 
C arisen et al. (1) to examine model selection, 
confounding, and selection bias (13). In that 
paper, we noted that estimates of mean sperm 
density from the U nited States and Europe 
declined somewhat more rapidly than had 
been reported by Carlsen et al. (1). In other 
parts of the world, where studies were few and 


most were quite recent, there was insufficient 
data to evaluate this question. W e also found 
that controlling for confounding bias, to the 
extent possible, provided additional support 
for the conclusions of Carlsen et al. (1) 
rather than reducing the estimated decline in 
sperm density. In the second analysis, pub¬ 
lished in 1999, we looked at sperm counting 
methods and the reliability of measurements 
from these historical studies (14). We found 
no evidence that counting methods had 
changed appreciably or that counts from 
older studies were less reliable than those 
from recent studies 

The current study extends our previous 
analyses in three ways First, we conducted 
an independent literature review to evaluate 
possible bias in the selection of studies used 
by Carlsen et al. (1). Second, we examined 
the robustness of the models utilized in that 
analysis (and ours) by applying these models 
to an expanded data set. Finally, we assessed 
the consistency of post-1990 data with 
trends in sperm density from studies pub¬ 
lished before 1990. 

Methods 

Analysis of Carlsen etal. study. Carlsen et al. 
(1) screened studies published from 1930 to 


mid-1990 to identify studies that included 
estimates of sperm density. They excluded 
studies that included men in infertile couples, 
men who were referred because of genital 
abnormalities, and studies that selected men 
on the basis of their sperm count. Studies 
that used nonmanual methods for counting 
sperm were also excluded. Carlsen et al. (1) 
included 61 studies published between 1938 
and 1990. The authors estimated the rate of 
change in mean sperm density as a function 
of publication year by fitting a simple regres¬ 
sion model. 

Current analysis The current analysis 
includes 54 of the 61 studies analyzed by 
Carlsen et al. (1). As in our previous paper 
(13), we excluded three non-English lan¬ 
guage studies (15-17) because it was not 
practical for us to systematically review the 
non-English language literature on this sub¬ 
ject. We also excluded two studies that 
included men who conceived only after an 
infertility work up (18,19), studies that did 
not meet the eligibility criteria of Carlsen et 
al. (1). Finally, we did not include any stud¬ 
ies with less than 10 subjects, which resulted 
in two additional exclusions (20,21). The 
most recent study in Carlsen et al.'s analysis 
(1) and our 1997 reanalysis (1) was published 
in J une 1990. To extend the study period, 
we conducted a search of M edline (N ational 
Library of M edicine, Bethesda, M D) for 
English-language studies published between 
1990 and 1996 and found 19 that met these 
eligibility criteria. We also conducted a less 
systematic search of the 60-year period 
1930-1990 and identified 28 additional eli¬ 
gible studies Therefore, the current analysis 
is based on 101 English-language studies 
published in 1934-1996 (54 "Carlsen" stud¬ 
ies and 47 "non-C arisen" studies), each with 
at least 10 men and all satisfying the eligibili¬ 
ty criteria published by Carlsen etal. (l).The 
47 "non-Carlsen" studies are summarized in 
theAppendix. 

Each of these 101 studies was reviewed 
independently by two of us to systematically 


Address correspondence to S.H . Swan, Department 
of Family and Community M edicine, U niversity of 
M issouri-Columbia, MA306 Medical Sciences 
Building, Columbia, MO 65212 USA. Telephone: 
(573) 882-3126. Fax: (573) 884-6172. E-mail: 
swans@health.missouri.edu 

We thank the reviewers for their helpful sugges¬ 
tions. 

Received 8 February 2000; accepted 1J une 2000. 


Environmental Health Perspectives • volume 1081 number 101 October 2000 


961 




Articles Swanetal. 


abstract detailed information on potential 
confounders and several measures of semen 
quality. These variables included mean (or 
median) sperm density, publication year, 
study location (state and country), study 
goal (to estimate population parameters, 
other), criteria for recruiting study subjects 
(proven fertility, prevasectomy, potential 
sperm donor, other), percent of men with 
proven fertility, semen collection method 
(masturbation into container, other, unspec¬ 
ified), sperm counting methods (manual, 
not reported), number of samples per indi¬ 
vidual, age (mean or range), and abstinence 
time (mean or range, protocol requirement if 
applicable). Information on the complete¬ 
ness of this information was also recorded. 

Previous analyses, including ours (23), 
have looked at the trend in sperm density as 
a function of publication year. However, 
because time of sample collection always pre¬ 
dated publication, often by several years, we 
decided to use the time of sample collection, 
or its estimate, rather than the year of study 
publication. For the 22 studies that reported 
the beginning and end of the sample collec¬ 
tion period, which often spanned several 
years, we used the midpoint to estimate the 
year of sample collection. The median lag 
time from the midpoint year to publication 
was 3 years for these studies. Therefore, 
when the dates of sample collection were 
unavailable, we subtracted 3 years from the 
publication year to estimate the year of sam¬ 
ple collection. Finally, to obtain intercepts 
that were more easily interpretable and to 
aid in convergence of more complex models, 
we subtracted 1,900 from the estimated year 
of samplecollection. 

The arithmetic mean sperm density was 
reported in all but six studies. For these six 
studies we estimated the difference between 
the arithmetic mean and the reported sum¬ 
mary measure (median or geometric mean) 
using data from studies for which multiple 
summary measures were available. For the 
five studies that reported median sperm den¬ 
sity only, we estimated the arithmetic mean 
by adding 12.0 to the median, whereas for 
the single study that reported only a geomet¬ 
ric mean, we added 22.7 to approximate the 
arithmetic mean. 

We followed an analysis strategy similar 
to the one we used previously (13). After 
conducting a simple linear regression, we 
stratified the 101 studies into three broad 
geographic groupings: North America (44 
studies, published 1934-1996), Europe (34 
studies, published 1949-1996), and other 
countries (23 studies, published 1978-1995). 
We then used multiple regression models 
(using procedures for linear and nonlinear 
regression as well as generalized linear models) 
to fit linear, step, spline, and quadratic models 


(22). In these models we included con- 
founders that were related to sperm density 
and/or year in univariate analyses Interactions 
between year and region, which can indicate 
geographic differences in the rates at which 
sperm density changed, were examined in all 
multiple regression models. To assess the 
extent to which each variable confounded the 
relationship between sperm density and year, 
we calculated the slope (in the model without 
interaction terms) with and without that vari¬ 
able included in the model. The magnitude of 
confounding is estimated by the degree of dis¬ 
crepancy between these two estimates (23). As 
with previous analyses, data from each study 
were weighted by the number of men includ¬ 
ed in that study, and sperm densities are given 
in unitsof 10 6 /mL. 

Results 

The estimated year of sample collection in 
these 101 studies ranged from 1931 to 1994 
(publication year 1934-1996). As shown in 
Table 1, the majority of new studies were 
published after 1980. M ean sperm density 
and mean publication year from studies with 
and without information about year(s) of 
sample collection did not differ appreciably. 

T he geographic distribution of these 101 
studies, representing 28 countries and 19 
states within the United States, was similar 
to that in previous analyses, but with a 
somewhat greater proportion of European 
studies (T able 2). W e made two changes in 

Table 1. Publication year of studies in three analyses. 


our geographic strata; the stratum we previ¬ 
ously labeled U nited States is now denoted as 
North America in order to include a (new) 
Canadian study. In addition, Australia, which 
was previously included with European stud¬ 
ies, is now included with "other countries" 

Simple linear model. For comparison 
with C arisen et al. (1), we first replicated 
their simple linear regression. As shown in 
Table 3, the slope for the regression line in 
the expanded data set (-0.94 x 10 6 mL/year; 
p < 0.0001) is very similar to that found for 
the original 61 studies (~0.93 x 10 6 mL/year; 
p < 0.0001). These estimates differ only 
slightly from the slope we reported in our 
1997 analysis (13): (-0.95 x 10 6 mL/year; p 
< 0.0001). T he fit of the regression line to 
thelOl data points is shown in Figure 1. 

Assessing confounding and interaction. 
T o select variables for our analysis, we initial¬ 
ly included all variables for which we had 
abstracted data and we noted the percent 
change in the slope that resulted when we 
removed them one at a time. Several of these 
were unrelated to sperm density or publica¬ 
tion year (change < 10%) and were dropped 
from further analysis. These variables were 
the number of samples per subject, whether 
the years of sample collection were reported, 
whether the arithmetic mean was reported, 
and purpose of the study. Although remov¬ 
ing age changed the slope by only 1.2%, we 
included this variable in the final model 
because it is a basic demographic variable 


Publication 



New studies in 

All studies in 

year 

Ca risen etal. (1) 

Swan etal. (13) 

current analysis 

currentanalysis 

1930-1959 

10(16%) 

8(14%) 

2(4%) 

18(1%) 

1960-1979 

17 (28%) 3 

14 (25% ) a 

3(6%) 

16(16%) 

1380-1989 

33 (54% ) a 

33 (59% f 

23(49%) 

55 (54%) 

1990-1996 

1(2%) 

1(2%) 

19(40%) 

20(20%) 

Total 

61 

56 

47 

101 

“Includes one study 

with <10 subjects tt 

latwas excluded fromthe current analysis. 


Table 2. Geographic distribution of studies in three analyses. 





New studies in 

All studies in 

Region 

Ca risen etal. (1) 

Swan etal. (13) 

currentanalysis 

currentanalysis 

North America 

28 (46% ) a 

27 (48% ) 3 

18(38%) 

43(43%) 

Europe 

17 (28% ) 6 

15(28%!** 

20(43%) 

35(35%) 

Other 

16(26%) 

14 (25%) 

9(19%) 

23 (23%) 

Total 

61 

56 

47 

101 

“Includes two studii 

ss with <10 men that were excluded from the current analysis. 


‘Includes one Australian study included 

in “other" in the current analysis. 


Table 3. Results of fitting a simple linear regression model 

in three analyses. 





Subset of Carlsen 

All studies in 

Factor 

Ca risen etal. (1) 

in currentanalysis 3 

currentanalysis 3 

Number 


61 

54 

101 

Publication years 


1938-1990 

1938-1990 

1934-1996 

Slope 


-0.93 

-0.95 

-0.94 

p-Valtjft 


<0.0001 

<0,0001 

<0.10.81. 

¥ 


0,36 

0.36 

0.22 


“Excludes non-English language studies and those with <10men. 
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often included in analyses of sperm density. 
The method of counting sperm was also 
included (although removing it changed the 
slope by only 6%). In this expanded set of 
studies, recruiting criteria and the percent of 
men with proven fertility were highly corre¬ 
lated, so only one of these variables (fertility) 
was retained for further analyses T he follow¬ 
ing variables were included in all subsequent 
multiple regression models: geographic 
region, age, abstinence time, percentage of 
men with proven fertility, method of count¬ 
ing sperm, and method of sample collection 
(Table 4). Of these, all but the method of 
counting sperm had been included in our 
previous analysis (13). Because one of the 
goals of this study was to examine the effect 
of adding new studies we also kept a variable 
that indicated whether the study had been 
included by Carisen et al. (1), even though 
removing it from the model had little effect 
on the slope. D espite the incompleteness of 
data on many covariates, the inclusion of the 
variables contained in Table 4 did improve 
model fit. W hen the simple linear model was 
compared with the multivariate linear model, 
including these covariates, the adjusted R 2 
increased from 0.22 to 0.59. 

In addition to including these covariates 
singly, we examined interaction terms to 
allow for different slopes in the three 


200 

180 



1930 1940 1950 1960 1970 1980 1990 


Year of sample collection 

Figure 1. M ean sperm density in 101 studies pub¬ 
lished 1934-1996 and simple regression line. 





Year of sample collection 

Figure 2. Interactive regression model for mean 
sperm density by year and geographic region, 
after controlling for proven fertility, abstinence 
time, age, specimen collection method, method of 
counting sperm, whether the study was included 
by Carisen et al. (1), and interaction of region and 
study year. 


geographic regions In our previous analysis 
(13), the three slopes that we estimated 
differed considerably (-1.50, -3.13, and 
+1.56, respectively, for the U nited States, 
Europe'Australia, and other countries). In 
the current analysis the European slope 
(-2.35) still differed from that for North 
American (difference in slopes-1.55; Cl, 
-2.90--0.21), indicating significant interac¬ 
tion (Figure 2). Although we did include the 
slope of the best fitting line for other coun¬ 
tries (-0.60), the fit to a linear model for data 
from these countries was not good and the 
confidence interval was very broad. G iven the 
limited data, there was no evidence that this 
slope differed appreciably from those from 
other regions (Table 5). 

Nonlinear models. We also fit a number 
of nonlinear models (quadratic, spline, and 
step) using the same set of covariates that 
were used for the linear model (Table 4). 
Olsen et al. (8) suggested that these models 
were preferable to Carlsen's simple linear 
model (I). In our 1997 analysis (13), we 
showed this was not the case, once geograph¬ 
ic region and the interaction of region and 
year were included in the model. 

I n our previous study (13), we had not 
seen any difference between the spline and 
linear models except a slight (nonsignificant) 
change in the U nited States post-1970 (from 
-1.52 to -1.47; p for spline term = 0.97). 
W hen a spline model was fit to the current 
expanded data set, the pre-1970 North 


American studies showed a somewhat 
steeper decline than those published after 
1970 (-0.93 vs. -0.55), although this dif¬ 
ference was still not significant (p for spline 
term =0.71). 

In our 1997 analysis (13), quadratic 
terms could not be estimated and we found 
no evidence of curvature within any of the 
three regions studied. I n the present analysis, 
it was possible to estimate the quadratic 
term, but its addition did not improve the fit 
of the model; the quadratic terms were negli¬ 
gible and none approached statistical signifi¬ 
cance. Thus, we again found no evidence to 
support either curvature, or a "leveling off" 
in the rate of decline in recent years 

In our 1997 analysis (13), we also fit a 
step function and found a significant post- 
1970 decrease in sperm density in all regions 
relative to pre-1970 data (which was entirely 
from the U nited States). Again, results were 
similar in the current analysis When a step 
function was fit, comparing the mean sperm 
density for North America before and after 
1970, a large step was seen (138 x 10 6 /mL 
vs. 113 x 10 6 /mL; p for difference < 0.001). 
The pre-1970 mean from North America 
was also significantly higher than the mean 
for studies from other (p < 0.001), whereas 
the mean for all post-1970 European studies 
fell between the pre- and post-1970 North 
American mean. 

Overall, the data fit these multiple 
regression models approximately equally (all 


Table 4. Distribution of covariates retained in multiple regression models. 3 


Variable 

No. of studies 
(n = 101) 

Mean sperm 
density 6 (10 6 /mL) 

Mean year of 
sample collection 6 

Region 

N orth America 

44 

78 

1970 

Europe 

34 

87 

1982 

Other 

23 

65 

1984 

Included by Carlsen et al. (1) 

Yes 

54 

77 

1974 

No 

47 

68 

1985 

Age 

All flap <40 years of age 

23 

97 

1970 

Some men >40 years of age 

53 

71 

1980 

No information 

25 

88 

1963 

Abstinence time 

Data reported: none <3 days 

14 

81 

1976 

Data reported: some <3 days 

14 

78 

1983 

N o data reported: protocol 

4:9 

80 

1977 

restrictions reported 

No information 

24 

68 

1976 

Prove# fertility 

W ife pregnant arpost-partum 

20 

82 

1968 

At least 90% proven fertility (past) 

31 

67 

1978 

<90% proven fertility (past) 

8 

84 

1979 

N o information 

42 


1984 

M ethod of seme# collection 

M asturbation into container 

M 

S 

1979 

Other or no information 

31 

95 

1971 

M ethod of counting sperm 

M anual 

66 

71 

1982 

N o information 

35 

79 

1971 


*Does not include geographic region, which is shown in Table 2. kJnivariate/unadjusted, weighted by the number of men 
in each study. 
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adjusted R 2 s were between 0.56 and 0.61), 
but not quite as well as the models fit the 
data in our previous analyses (13) (Table 5). 
As in our 1997 analysis (13), when multiple 
regression models that include terms for the 
interaction of geographic region and year are 
used, there is no support for the use of a 
nonlinear model. 

Discussion 

As we stated previously (13), control for con¬ 
founding in these analyses can be only partial 
because of incomplete data. Therefore, it is 
possible that residual confounding remains 
How large is this likely to be? 

0 ne of the strongest confounders in this 
analysis was the type of population studied. 
We examined this factor in two ways: the 
percent of men with proven fertility and the 
type of study population (sperm donor, pre¬ 
vasectomy, etc.) Because these variables were 
highly correlated, we retained only one (the 
percent of men with proven fertility) in the 
final model. When this variable was added to 
the other variables in the multiple regression 
model, it increased the magnitude of the 
slope considerably (37.2%). 

Zavos and Goodpasture (24) reported 
that sperm concentration is higher when 
semen samples are obtained using a collec¬ 
tion deviceduring intercourse than when the 
same subjects collect samples by masturba¬ 
tion (p <0.01), a result that has been report¬ 
ed by others (25). In the current analysis, 
studies that did not require collection by 
masturbation tended to be earlier (mean 
publication year 1970 vs. 1978). Therefore, 
this variable was a strong (positive) con- 
founder; when it was added to the model, 
the magn itude of the slope decreased 34.1%. 

Carisen et al. (1) required that sperm be 
counted by manual methods in all the studies 
that they included in their analysis. 
Nevertheless, because manual counting 
devices have changed somewhat over the 
study period, when reviewing these studies, 
we abstracted information on the specific 
counting method that was used. When the 
particular counting device was not specified, 
we assumed it was manual. N onmanual 
methods are a relatively recent advance and 
are still considered experimental, so that 
studies that use nonmanual methods are like¬ 
ly to specify the use of such methods In 62 
of these 101 studies, the counting device was 
specified to be the hemocytometer, the 
method that has been continually recom¬ 
mended by the World Health Organization 
since 1980 (26,27). T he only other counting 
method that was specified, the M akler cham¬ 
ber (28), was mentioned in only 2 studies of 
101 studies. Thus, we found no evidence 
that the introduction of newer counting 
devices has resulted in lower sperm counts 


In fact, when systematic changes have been 
introduced by newer methods, they tended 
to result in higher counts (14). In any case, 
this variable appeared to have little effect on 
the observed decline in sperm density. 

Some researchers have criticized the use 
of sperm count estimates from early in the 
study period, arguing that greater measure¬ 
ment error was likely in these historical stud¬ 
ies. Greater imprecision in earlier studies 
could not have produced the negative slope 
we observed in Western countries A change 
in the variability of sperm counts would, 
however, violate a basic assumption underly¬ 
ing the regression methods used in these 
analyses, the assumption of constant vari¬ 
ance. Was this assumption justified? To 
answer this question we looked for a trend in 
the standard deviation of sperm density in 
these historical studies. We modeled the 
standard deviation (which was reported in 
34 studies) as a function of year and found 
no evidence of a trend (slope = -0.24; p = 
0.22) (14). We also used a multiple regres¬ 
sion model to examine possible confounding 
of this relationship, but found no evidence 
of this We concluded, therefore, that there 
has been no significant change in the stan¬ 
dard deviation of sperm density over time. 

Geographic region and the interactions of 
region and year were important covariates in 
these analyses. H owever, these geographic 
groupings are large and heterogeneous. For 
example, the category "other countries" 
included Thailand, India, Hong Kong, 
Brazil, Australia, Kuwait, Nigeria, Israel, 


Libya, Tanzania, Peru, Egypt, China, and 
Saudi Arabia. Several studies suggested that 
mean sperm density and trends in semen 
quality may vary considerably, even within 
small areas (29,30), so that it would have 
been desirable to stratify studies into narrow¬ 
er geographic categories if sufficient data had 
been available. Unfortunately, because many 
of these countries contributed only one 
study, it was not possible to use narrower 
geographic strata. 

Abstinence time is known to be 
strongly related to sperm density (31-33). 
In this analysis, when abstinence time was 
added to a linear model that included all 
other variables, the magnitude of the 
slope decreased by 10.6%, suggesting 
moderate confounding. Although the 
inclusion of abstinence time in the model 
appears to have reduced confounding to 
some extent, control for this variable was 
undoubtedly incomplete because less than 
one-third of these studies included report¬ 
ed abstinence times. An additional 49% 
of studies noted that abstinence times 
were restricted by study protocol but, as 
has been demonstrated, these protocols 
are only advisory. Auger et al. (2) noted 
that only 66% of men adhered to the pro¬ 
tocol-specified abstinence time of 3-5 
days. O n the other hand, to account for 
the observed decline in sperm density, 
abstinence time would have had to 
decline appreciably over the study period. 
The evidence for this is not strong; stud¬ 
ies with longer abstinence times (none <3 


Table 5. Comparison of multiple regression models from Swan et al. (13) (n = 56) and the current analysis 
(n = 101). a 


Model 

Adjusted R 2 

Region 

Slope within region 

Interaction 
beta p-Value 

Linear 

0.80 

United States 

-l.SiM.90- -1.10) 

Referent 


Swanetal. (1® 


EuropelAustralis 

-3.13 C4.96--1.30) 

'*$,63 

0.08 



Other countries 

1.56 (-1.00-4.12) 

3.06 

0.03 

Current analysis 

tit 

North America 

- 0,80 (“ 1.37- - 0.24) 

Referent 




Europe 

-2.35 (-3.66--1.05) 

“IIS 

0.03 



Other countries 

~#,21(’230-1.88) 

0:60 

0:56 

Spline 

0.79 

United States <1970 

-1.52 (-2.37--0.66) 

Referent 


Swanetal (13) 


United States >1970 

-1.47 (-3.00-0.06) 

0.04 

IS? 



Europe/Austral® 

-3.12 (-4.99--1.26) 

-1.61 

0.13 



Other countries 

1.56 (-1.03- 4.16) 

lit 

0.04 

Current analysis 

0.60 

North America <1970 

-0.93 (-1.81--0.05) 

Referent 




North America >1970 

“0.55 (-2.00-0.89) 

0,37 

0,71 



Europe 

-2.32 (-3.64--1.00) 

-1,18 

0.09 



Other countries 

-0.25 (-2.37-1.86) 

0.68 

0.52 

Step 

0.72 

United States <1977 

106.7 (910-122.5) 

_ 


Swan et al. (1J) 


United States >1970 

67.7 (55.-9-79.5) 

- 




EuroppfAustralis 

7IJ #0:0:- 90.0) 

- 




Other countries 

583(46:0-707) 

- 


Current tni lysis 

0:17 

North America <1970 

1373 (116 6-159 3) 

_ 




North America >1970 

113.2 (95.9-130.6) 

- 




Europe 

120 1 (103.6-1366) 

- 




Other countries 

104.0 (84.7-123.4) 

- 



“Controlled for proven fertility, abstinence time, age, specimen collection method, study goal and interaction of region and 
study year [Swan et al. (33)]; and proven fertility, abstinence time, age, specimen collection method, method of counting 
sperm, whether study was included by Carlsen et al. (Z); and interaction of region and study year (current analysis). 
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days) were published only slightly earlier 
than those that included some abstinence 
times <3 days (1976 vs 1983). 

After controlling for abstinence time and 
other covariates, the addition of age to the 
model increased the magnitude of the slope 
by only 1.2%. However, we found little evi¬ 
dence that age is an important predictor of 
sperm density. Information was quite incom¬ 
plete for this variable. Twenty-five studies 
contained no information on age, and these 
tended to be older studies (mean publication 
year 1962). For the remaining studies, many 
only included an age range, so that we were 
only able to categorize age into broad cate¬ 
gories Nevertheless, we chose to retain this 
variable in the model for comparability to 
other analyses. 


The current analysis suggests that the 
previously reported trends have continued, 
at least until 1996. We have also shown that 
the studies initially used by C arisen et al. (1) 
did not represent a biased selection of the 
English language literature. Nevertheless, it 
is likely that neither this publication nor fur¬ 
ther statistical analyses of historical data will 
resolve the continuing debate over declining 
sperm counts. Critics will continue to chal¬ 
lenge the reliability of historical data, and 
most will agree that residual confounding, 
which may be appreciable, cannot be com¬ 
pletely eliminated. 

The entire issue of declining sperm 
count has gained in importance because of 
the recognition of several other trends that 
reflect a decline in male reproductive health. 


Appendix 1. Studies not included by Carlsen et al. (I). 


Author (reference) 

Year 

Country 

Sample 

Mean sperm 
density 

Aitden etal m 

1982 

Btfifji Kingdom 

35 

129.30 

Assennato et al. (40) 

1987 

Italy 

18 

85.00* 

BetdiJig (41) 

1934 

United States 

15 

119.00 

Coo per etal. (42) 

1991 

Germany 

25 

94.20 

Cottell and Harrison (43) 

1995 

Ireland 

10 

159.90 

Davidson (44) 

1949 

. ' ..Jilted Kingdom 

15 

143,0ft 

De Castro and M astrorocco (45) 

1984 

Brazil 

501 

66.20 

De Castro et al. (46) 

1990 

Brazil 

1,880 

37,0ft 

DBsghfftyet at. (47) 

1981 

United States 

132 

83.00 

el Shoura etal. (48) 

1995 

Saudi Arabia 

50 

Hill 1 

Eskenazi etal. (49) 

1991 

United States 

48 

87.00 

Fariss et ai. (50) 

1981 

United States 

112 

75.00 j 

Fedder etal. (51) 

1993 

Denmark 

42 

80,00 

Figa-Talamanca et al. (52) 

1996 

Italy 

50 

62.00 

Frick etal. (53) 

1981 

Finland 

13 

13.90 

Glass etal. (54) 

1979 

United States 

22 

61.70 

] elnes (55) 

1988 

Denmark 

68 

1.2ft* 

J ensen et ai. (5# 

1996 

Denmark 

141 

69.20 

Kolonetal (57) 

1992 

Bitted States 

10 

132.80 

Levine et al.(5S) 

1992 

United States 

142 

*74 

M ilby and W horton (53) 

1910 

United States 

90 

93.0Q* 

NnatuetsMSf 

1991 

Nigeria 

m 

64.40 

Noack-Futler etal. (61) 

1993 

Germany 

22 

86.50 

Ratcliff© et al (62) 

1987 

United States 

43 

68.72 

Richardson etal. (63) 

1984 

toted Kingdom 

47 

130.60 

Rogers etal. (64) 

1979 

" United States 

n 

114.00 

Rosenberg et al. (55): 

1985 

United States 

* ft. '' 

99.20 

Saaranen etal, (6fi) 

1987 

Finland 

190 

14117 

Schrader etal. (67) 

1988 

United States 

45 

17.43 

Shaarawyaticj M ahraoud (68) 

1982 

Egypt 

45 

77.50 1 

Sheriff (69) 

1987 


IQ 

51.40 

Sheriff and Legnam (70) 

1992 

Libya 

1,250 

15,08 

Spira (71) 

1984 

France 

52 

92.30 

Stankovic and M ikac-Devic (72) 

1976 

Yugoslavia 

67 

55.26 

Sugkraroeketal. (73) 

1991 

Canada 

it 

128.50 

Venable etal. (74) 

1980 

United States 

63 

11353 

VipoRetal. (75) 

1991 

France 

20 

is® 

Vogt etal. (76) 

1986 

Germany 

239 

62.69 

Wallace etal. (77) 

1993 

United Kingdom 

28 

85.50 

Wang and Yeung (78) 

1980 

Hong Kong 

15 

88.90 

Ward etal, (79) 

1984 

United States 

11 

87.40 

W eidner et al. (80) 

1991 

Germany 

42 

52.00* 

Weyandtetal. (81) 

1996 

United States 

;Si 

38.00 

WicBiifs-etal. (82) 

1983 

Germany 

.25 

84.70 

Wyrobek etal. (83) 

1981 

Bitted States 

26 

66.20 

W yrobek et al. (84) 

1981 

United States 

34 

128.70 

Zhong et si, (15) 

1990 

.Oai 

19 

96.60 


♦Arithmetic mean estimated from median or geometric mean (see text). 


Testicular cancer incidence has increased 
significantly for at least the past 20 years in 
most of the Caucasian populations that have 
been studied (30,34,35). T rends in rates of 
cryptorchidism are consistent with those for 
testicular cancer, for which cryptorchidism is 
a significant risk factor (30). These increases 
in rates of testicular cancer and male genital 
tract abnormalities, like decreasing sperm 
density, have primarily been seen in Western 
countries. Several authors have suggested 
that these trends, together with decreases in 
semen quality, may reflect a more general¬ 
ized increase in testicular dysfunction 
(30,36,37). Although few of these trend 
studies have examined possible causes, com¬ 
mon environmental exposures are plausible. 
If environmental factors have produced 
some, or all, of the temporal changes in 
sperm density, the regional differences that 
have been reported in semen quality, even 
within countries (6,38), may also reflect vari¬ 
ation in these environmental factors. 

Studies that examine differences in semen 
quality between geographically diverse 
cohorts may help identify such factors. An 
ongoing network of international studies, 
begun in 1997, was designed to address this 
question. In these collaborative prospective 
studies, the use of common study protocols, 
analytic methods, and quality control proce¬ 
dures should minimize extraneous interstudy 
differences. These studies should provide 
unbiased estimates of variability among cities 
that have been reported to differ widely in 
semen quality, provide baseline levels of male 
biomarkers for future studies, and generate 
hypotheses of environmental causes of varia¬ 
tion in these parameters 
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